Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added prototypes of Data sampler for CNNs and Two versions of Mini-Unet #201

Draft
wants to merge 16 commits into
base: master
Choose a base branch
from

Conversation

iiroseppa
Copy link
Collaborator

@iiroseppa iiroseppa commented Oct 11, 2023

I propose to add two versions of Mini-Unet and associated data sampler.

  • I have working prototypes, which need further developing. However, I want to submit them now because the project deadline for new algorithms is arriving. The core of the algorithm is finished, however there is a lot things that need implementation before this is ready to be published.

TEST DATA

https://seafile.utu.fi/d/3aaf213912104cb28223/

Mini U-net with learned uncertainty estimation

  • Semantic segmentation for (small) images, with included learned uncertainty estimation
    • There is separate script for semantic segmentation without uncertainty estimation
  • The idea is based on
  • This implementation takes strong inspiration from
    • Wu, S., Heitzler, M. & Hurni, L. (2022) ‘Leveraging uncertainty estimation and spatial pyramid pooling for extracting hydrological features from scanned historical topographic maps’, GIScience & Remote Sensing, 59(1), pp. 200–214. Available at: https://doi.org/10.1080/15481603.2021.2023840.
  • Loss based on (Sørensen -)Dice coefficient is utilized due to favorable properties related to unbalanced pixel counts.
    • Milletari, F., Navab, N. & Ahmadi, S.-A. (2016) ‘V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation’, in 2016 Fourth International Conference on 3D Vision (3DV). 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571. Available at: https://doi.org/10.1109/3DV.2016.79.

To summarise the main idea

  • In addition to segmentation, the model is asked to create certainty estimation
  • When overall certainty is high (close to 1) the loss function operates as if no certainty estimate was used
  • When certainty values dip below 1, the model is punished less for making mistakes at those pixels.
    • When certainty is close to zero at certain pixel, the model segmentation output does not matter at all for the loss at that pixel
  • To prevent model predicting high uncertainty everywhere, it is punished for being uncertain at large areas
    • This is done by averaging logarithm of the uncertainties at each pixel * -1 together
    • This regularization loss is multiplied with a hyperparameter called uncertainty coefficient
    • The value of the uncertainty coefficient has a large effect on the results, and finding suitable value should be automatised (not yet implemented)
      • Too small values result in poor segmentation results
      • Too large values result in abolute certainty everywhere
      • Suitable value creates resulting values between full range of zero and one for both segmentation prediction and certainty estimate
      • Thus closer (max cert - min cert) + (max pred - min pred) is to to 2, the better the uncertainty coefficient. This could be used as a basis for optimisation

Notes

  • Current implementation is experimental and needs further optimization, testing and functionality
    • IMPORTANT The possibility to input a large geotiff should be implemented. In that case the trained model should be used to predict the whole area in a sliding window type of setup. Then it should be parsed back together to create output prediction and uncertainty estimate geotiffs for the whole area. This tool should not be included in the toolkit before that can be done.
    • hyperparameter tuning should be automatized
    • Dice coefficient and Intersection over Union should be implemented as metric to measure the goodness of model
      • custom metrics are needed due to the unusual shape of the output
    • Data suffling should maybe moved to data sampling side
      • Leave one out cross validation should also be implemented and integrated with data sampling, so that user has reliable tools to estimate possible (likely) overfitting
      • current implementation with data sampler does not create reliable test data, because one occurence features multiple times in data with different croppings.
      • The other option is to take out all the times one of the features occurs before suffling and keep data sampling as is
  • The current implementation only works with binary classes (0 and 1)
  • Image size should be at least 32 x 32 pixels or larger power of 2 (or at least divisible three times with 2, after which size should be at least 3 x 3)
  • Using either data from automatic sampling function or user's own data should be given as an option
  • The model needs to be trained long enough so that it produces both prediction and uncertainty, even if the uncertainty punishment coefficient is correct. The balancing is quite intricate and highlights the need for automatic hyperparametrisation

Mini U-net

  • Semantic segmentation for (small) images, with included learned uncertainty estimation
    • There is separate script for semantic segmentation with learned uncertainty estimation
  • Loss based on (Sørensen -)Dice coefficient is utilized due to favorable properties related to unbalanced pixel counts.
    • Milletari, F., Navab, N. & Ahmadi, S.-A. (2016) ‘V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation’, in 2016 Fourth International Conference on 3D Vision (3DV). 2016 Fourth International Conference on 3D Vision (3DV), pp. 565–571. Available at: https://doi.org/10.1109/3DV.2016.79.
  • This implementation also takes inspiration from
    • Wu, S., Heitzler, M. & Hurni, L. (2022) ‘Leveraging uncertainty estimation and spatial pyramid pooling for extracting hydrological features from scanned historical topographic maps’, GIScience & Remote Sensing, 59(1), pp. 200–214. Available at: https://doi.org/10.1080/15481603.2021.2023840.

The main idea

  • Learns to segment areas of interest with fewer parameters than normal U-net, which reduces the need for training data and also the risk for overfitting

Notes

  • Current implementation is experimental and needs further optimization, testing and functionality

    • IMPORTANT The possibility to input a large geotiff in addition to training and testing data should be implemented. In that case the trained model wouldbe used to predict the whole area in a sliding window type of setup. Then it should be parsed back together to create output prediction and uncertainty estimate geotiffs for the whole area. This tool should not be included in the toolkit before that can be done.
    • For example, hyperparameter tuning should be automatized
    • Dice coefficient and Intersection over Union should be implemented as metric to measure the goodness of model
      • custom metrics are needed due to the unusual shape of the output
    • Data suffling should maybe moved to data sampling side
      • Leave one out cross validation should also be implemented and integrated with data sampling, so that user has reliable tools to estimate possible (likely) overfitting
      • current implementation with data sampler does not create reliable test data, because one occurence features multiple times in data with different croppings.
      • The other option is to take out all the times one of the features occurs before suffling and keep data sampling as is
  • The current implementation only works with binary classes (0 and 1)

  • Image size should be at least 32 x 32 pixels or larger power of 2 (or at least divisible three times with 2, after which size should be at least 3 x 3)

  • Using either data from automatic sampling function or user's own data should be given as an option

Data sampler

Creates a set of images that can be used for training a convolutional neural network.

Input

  • Geotiff that will be sampled
  • Bounds of the geotiff, geopackage or other geospatial vector format
    • TODO: Just extract the extent from geotiff nodata mask
  • locations of the wanted class, geopackage or other geospatial vector format

Output

  • Two folders with geotiff images inside, one containing images and the other containing labels

Internal workings

  • The known occurrences are oversampled and negative samples are selected randomly.
    • This ensures that both sample types are represented equally in the created dataset.
  • The sampler works well with sparse datasets, where it is relatively rare that the negative samples would contain an unkown occurence
    • Thus there shouldn't be too much misclassifications in the data that would cause problem
  • User has the option to set buffer size for the known occurences, because one pixel locations are harder to locate for model than .

TODO

  • Current implementation creates images in order, known occurences first and non-occurences last. This makes suffling necessary at the model script, which complicates things. Suffling should be moved to this script
  • This sampling script should be integrated to the cnn scripts so that leave one out cross validation would be easy to do
    • The leave one out method should be implented in a way that one feature is completely removed from the occurence data before any sampling is done, and then test data is creaed by sampling the left out feature alone
    • without this it is hard to assess level of overfitting
    • users should be given a clear warning about htis, if the script is published without these changes. Overfitting is nearly unavoidable with low amounts of data so user should be given tools to asess the level of overfitting

@iiroseppa iiroseppa linked an issue Oct 11, 2023 that may be closed by this pull request
@zelioluca
Copy link
Collaborator

hello there so I prepared the iro script! Now I will prepare some test

Copy link
Collaborator

@zelioluca zelioluca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can you help me to delete the .idea folder?

@@ -72,3 +72,11 @@ class NonSquarePixelSizeException(Exception):

class NumericValueSignException(Exception):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

perfect! I change it now!

@zelioluca zelioluca self-assigned this Jan 17, 2024
@nmaarnio nmaarnio mentioned this pull request Apr 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add Mini-Unet for segmentation
4 participants